Progressive multi-scale fusion network for RGB-D salient object detection

نویسندگان

چکیده

Salient object detection (SOD) aims at locating the most significant within a given image. In recent years, great progress has been made in applying SOD on many vision tasks. The depth map could provide additional spatial prior and boundary cues to boost performance. Combining information with image data obtained from standard visual cameras widely used works, however, introducing suboptimal fusion strategy may have negative influence performance of SOD. this paper, we discuss about advantages so-called progressive multi-scale method propose mask-guided feature aggregation module (MGFA). proposed framework can effectively combine two features different modalities and, furthermore, alleviate impact erroneous features, which are inevitably caused by variation quality. We further introduce refinement (MGRM) complement high-level semantic reduce irrelevant fusion, leading an overall detection. Experiments five challenging benchmarks demonstrate that outperforms 11 state-of-the-art methods under evaluation metrics. • Novel structure for RGB-D saliency Mask-Guided Feature Aggregation filtering noise data. Refinement Module RGB Progressive deep shallow layers. Achieve competitive compared prevalent methods.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection

Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...

متن کامل

Local Background Enclosure for RGB-D Salient Object Detection - Supplementary Results

The purpose of this supplementary material is to examine in detail the contributions of our proposed Local Background Enclosure (LBE) feature. A comparison of LBE with the contrast based depth features used in state-of-the-art salient object detection systems is presented. The LBE feature is compared with the raw depth features ACSD [1], DC [3] and a signed version of DC denoted SDC on the RGBD...

متن کامل

Multi-Scale, Categorical Object Detection and Pose Estimation using Hough Forest in RGB-D Images

Autonomous Intelligent Systems Institute for Computer Science Master of Science Multi-Scale, Categorical Object Detection and Pose Estimation using Hough Forest in RGB-D Images by Ishrat Badami Classification and localization of objects enables a robot to plan and execute tasks in unstructured environments. Much work on the detection and pose estimation of objects in the robotics context focuse...

متن کامل

RGB-D Salient Object Detection Based on Discriminative Cross-modal Transfer Learning

In this work, we propose to utilize Convolutional Neural Networks (CNNs) to boost the performance of depth-induced salient object detection by capturing the high-level representative features for depth modality. We formulate the depth-induced saliency detection as a CNN-based cross-modal transfer problem to bridge the gap between the " data-hungry " nature of CNNs and the unavailability of suff...

متن کامل

Learning Graph Matching for Object Detection from Rgb-d Images

We propose an optimization method for estimating parameters in graph-theoretical formulations of the matching problem for object detection. Unlike several methods which optimize parameters for graph matching in a way to promote correct correspondences and to restrict wrong ones, our approach aims at improving performance in the more general task of object detection. In our formulation, similari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Vision and Image Understanding

سال: 2022

ISSN: ['1090-235X', '1077-3142']

DOI: https://doi.org/10.1016/j.cviu.2022.103529